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ABSTRACT 

This paper presents a new numerical algorithm for the solution of large-order 
eigenproblems typically encountered in linear elastic finite element systems. 
The architecture of parallel processing is used in the algorithm to achieve 
increased speed and efficiency of calculations. The algorithm is based on the 
frontal technique for the solution of linear simultaneous equations and the 
modified subspace eigenanalysis method for the solution of the eigenproblem. 
Assembly, elimination, and back-substitution of degrees-of-f reedom are per- 
formed concurrently by using a number of fronts. All fronts converge to 
and diverge from a predefined global front during elimination and back- 
substitution, respectively. In the meantime, reduction of the stiffness and 
mass matrices required by the modified subspace method can be completed during 
the convergence/divergence cycle, and an estimate of the required eigenpairs 
can be obtained. Successive cycles of convergence and divergence are repeated 
until the desired accuracy of calculations is achieved. The advantages of 
this new algorithm in parallel computer architecture are discussed. 
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GENERALIZED EIGENPROBLEM 


New parallel algorithm for the solution of large-scale eigenproblems in 
finite element applications 

Assumptions 

(1) Linear elastic finite element models 

(2) n lower order eigenpairs are required, i.e., < ••• 

(3) [K] is positive-definite 

(A) [M] is semipositive definite 


[K] [#] = [Ml [#] [Q] 

N - DEGREES OF FREEDOM 
REQUIRED n EIGENPAIRS, n < N 
[K] POSITIVE-DEFINITE 
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PARALLEL ARCHITECTURE 


Consider a parallel computer with (m+1) processors (tasks). 

Designate the first processor as a global processor (task). 

Designate the remaining m-processors as domain processors (tasks). 

A finite element model can be divided into a number of domains equal to 
A star architecture (or tree) is the first to be investigated. 


FINITE ELEMENT MODEL STAR ARCHITECTURE 

SUBDIVIDED INTO m DOMAINS 




CD-88-31669 



MAJOR COMPUTATIONAL TASKS 


Three major steps of large computational requirements 

(1) Creation of element stiffness and mass matrices 

(2) Extraction of a set of eigenpairs 

(3) Solution of a set of simultaneous linear equations 


The merits of selecting the modified subspace method for step 2 and the 
frontal solution for step 3 are discussed in the next viewgraphs. 


[K][*] = [M] [$] [0] 

(1) CREATION OF K e AND M e 

(2) EIGENSOLUTION (MODIFIED SUBSPACE) 

(3) EQUATION SOLVER (FRONTAL SOLUTION) 
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MODIFIED SUBSPACE METHOD 


The Modified Subspace Method iterates simultaneously for a subset of eigenpairs 
of the generalized eigenproblem: 

(1) Let [V]i be n starting eigenvectors. 

(2) Operate on each [V]^ as follows: 

[ <i = [K] -1 [M][V] ft = [K] -l [B]^, H - 1,2,3, . . . 

•ff 

(3) Modify [V]^ ^ to increase convergence rate by one third on average 

[VI Li * K.i - v vl !i 

where = 0 for 2=1 and 2 > 11 

13^ = 0.5 (l+r^j/co^ 1 < 2 < 11 

r^ are the interval points of the 11th order Labatoo rule [-1, 1] 

(4) Project K and M onto the required subspace. 

(5) Solve the auxiliary eigenproblem to obtain [Q]^ and [Q]^. 

(6) An improved set of eigenvectors [V]^ +1 can be obtained. 

2 

(7) Test for convergence on go . Repeat steps 2 to 6 until desired accuracy 

is achieved. n 


[Ki; + i=E[vi7 + i[K]m; + i 

[Mj; +1 =E[vi7 + itM]m ( ‘ + i 

6 


AUXILIARY EIGENPROBLEM 
[K]; +1 [Q), + 1 = [Mj; +1 [0], +1 [Ul t+1 


IMPROVED EIGENVECTORS 

w?+i=w;j 1 ra <+1 
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RATE OF CONVERGENCE 


Rate of convergence of the modified subspace is 33 percent faster on aver- 
age compared to the classical subspace method. 

The figure shows typical behavior. 

Most computations are performed on an element-by-element basis. 


2 

BASIC SUBSPACE < ■ 

- 2 
w n + 1 


MODIFIED SUBSPACE <__J_ 

- 2 
“>n + 1 


2 ~ n + 1 
1-^fW f 


TOLERANCE 

LEVEL, 


tol 



CONVERGENCE OF X 2 AND X 4 FOR SQUARE PLATE 
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FRONTAL SOLUTION 


(1) Gauss elimination technique 

(2) Underlying philosophy based on processing of elements one by one 

(3) Simultaneous assembly and elimination of variables 

(4) The optimum frontal width at most equal to the optimum bandwidth 

(5) Numbering of nodes - no impact on optimality; numbering of elements - 
important to minimize the frontal width 

(6) More efficient for solid elements and elements with midside nodes 

(7) Requires a prefront to determine last appearance of each node 

(8) Lends itself to parallel solutions 


WITHIN EACH DOMAIN 


k u 



^is^sj 


'ss 



«- b: 


iq 





'ss 
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DOMAIN PROCESSING 


Multitasking on the Cray supercomputer provides tools for implementing the 
frontal solution concurrently within a number of domains. Assembly and elimi- 
nation for the i-th domain is assigned to a task (processor). Simultaneous 
assembly and elimination of variables within the domains is performed in paral- 
lel tasks until the domain fronts reach their respective global fronts. How- 
ever, it is instructive to analyze the set of simultaneous equations for the 
i-th domain assuming that the domain stiffness matrix and right-hand sides are 
fully assembled before Gauss elimination is performed. 


For domain i 


[KHV] ft+1 - [B] a 


Elimination gives 


Vd * Vf ■ B d 


k f v f - b f 


at iteration i 


where U, upper A matrix for domain i 
d 

* 

V, variables within domain 1 
d 

& 

V„ variables along global front of domain i 
F 

B, and B„ are right-hand sides for domain variables and global fronts, 
d F 

respectively. 


/- GLOBAL FRONT 
/ FOR DOMAIN! 



Njd 


*d 


»d 


B d 

0 


k f 

. 


II 

b f 


i-th DOMAIN 


UdVj + K d Vf = B d 
K F Vf = B F 
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GLOBAL FRONTS 


m 

K FF ' T K f 
m 

b ff b f 

where m = total numbers of domains 

k ff v ff = b ff 

Solution for Vpp is then obtained by using the frontal solution on the global 

fronts. Since Vpp is a superset of all V* * back-substitution for V 
follows concurrently within the domains. 


m 

Kpp = E Kp 
m 

b ff = eb f 

WHERE m = TOTAL NUMBER OF DOMAINS 
K FF V FF = B FF 
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IMPLEMENTATION 


Successful implementation of the new parallel algorithm depends on 

(1) Maximizing the efficiency of communication links between the global 
task and the domains 

(2) Minimizing sequential computational steps 

(3) Multithreaded I/O 


















CONCLUSIONS 


Parallel solution method for eigenvalue extraction for linear elastic finite 
element models has been successfully implemented on the Cray supercomputer by 
using the multitasking environment. Preliminary results are encouraging and 
extensive testing of the new algorithm is currently progressing. The new algo- 
rithm enhances the speed-up of similar sequential solution methods. Both the 
frontal method for the solution of the set of simultaneous equations arising 
in finite element problems and the modified subspace method for the solution 
of a subset of eigenpairs offer a new algorithm which has been efficiently 
parallelized. The parallel tasks are associated with recognizable finite ele- 
ment domains rather than dissected blocks of abstract equations. Moreover, the 
complexity of data management and data flow normally associated with parallel 
solution methods is avoided in this new algorithm. 
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